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O [ Abstract For any system, whether physical or non-physical, knowledge of the form and strength 
^ \ of inter-individual interactions is a key-information. In an approach based on statistical physics 
one needs to know the interaction in order to write the Hamiltonian of the system: H = -£ff ree + 
' -^interaction- ^or non-physical systems, based on qualitative arguments similar to those used in 
physical chemistry, interaction strength gives useful clues about the macroscopic properties of the 
system (e.g. for an institution the dropout rate is expected to be smaller when the inter-individual 
SpH| attraction is stronger). 

Even though our ultimate objective is the understanding of social phenomena, we found that sys- 
terns composed of insects (or other living organisms) are of great convenience for investigating group 
c/3 ■ effects. In this paper we show how to design experiments that enable us to estimate the strength of in- 
teraction in groups of insects. By repeating the same experiments with increasing numbers of insects, 
ranging from less than 10 to several hundreds, one is able to explore key-properties of the interaction. 
The data turn out to be consistent with a global correlation that is independent of distance (at least 
within a range of a few centimeters). Estimates of this average cross-correlation will be given for 
ants, beetles and fruit flies. The experimental results clearly exclude an Ising-like interaction, that is 
to say one that would be restricted to nearest neighbors. In the case of fruit flies the average cross- 
correlation appears to be negative which means that instead of an inter-individual attraction there is a 
£T) \ (weak) repulsive effect. 

In our conclusion we insist on the fact that such "physics-like experiments" on insect populations 
provide a valuable alternative to computer simulations. When testable group effects are predicted by 
a model, the required experiments can be set up within a short time, thus permitting to confirm or dis- 
| prove the model. This marks a significant progress with respect to modeling of social systems where, 
all too often, the requested statistical data just do not exist, thus obstructing any fruitful dialogue 
^ ■ between theory and observation. 
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In a first version of the paper the title made reference to "statistical physics" rather 
than to "physical chemistry". Although chemistry itself plays no role in our investi- 
gation, we think it is important to emphasize that at this point it relies rather on the 



By using the theoretical framework of statistical mechanics one can derive the macro- 
scopic properties of a system from the characteristics of its microscopic elements. 
This is a major achievement and so it is hardly surprising that researchers from other 
disciplines (e.g. biology, demography, sociology or economics) have been tempted 
to adapt such a powerful tool to their own field. In light of the successful record 
of statistical mechanics in physics there is little doubt that such extensions appear 
highly desirable. Yet, to our best knowledge, in spite of many attempts in this direc- 
tion such attempts have not been highly successful so fafl 

Obstacles 

As a matter of fact this is hardly surprising for there are indeed many obstacles. 

• Statistical physics is fundamentally a theory of systems in equilibrium. For 
systems which are (strongly) out of equilibrium the very concept of temperature 
becomes meaningless. 

• Statistical physics relies on the identification of ensemble averages (which are 
predicted theoretically) and time-averages (which are measured in experiments). 
This so-called ergodic hypothesis may be valid for physical systems which move 
from one state to another every picosecond so that there are trillions of transitions 
during an observation time of a few seconds. Yet, it is not obvious that such an as- 
sumption can still be accepted for socio-economic systems for which the transition 
rates are much sloweiR 



• Last but not least, one should not forget that in order to use the theoretical frame- 
work of statistical mechanics one needs to know the Hamiltonian H of the system 
which indicates how energy is distributed in the system. Generally H includes three 
parts: 



A preliminary but extended version (some 100 pages) of the present paper is available on the following website: 
http://www.lpthe.jussieu.fr/^roehner/effusion.pdf 



2 Recently, some promising breakthroughs were made in this direction by Japanese economists; see Aoki and 
Yoshikawa (2007), Iyetomi et al. (201 1), Iyetomi (2012). 

3 The highest transition rates are probably those in currency exchange markets with hundreds of orders (worldwide) 
every second. Recently so-called high speed trading, that is to say transaction orders passed by computers, has reduced 
transition times to a few micro-seconds at least for a number of actively traded securities. 




Rationale and motivations 



H = H Q + H; 



inter 




3 



where Hq stands for the free particles, #j n t er for the interaction energy between 
them and i^exo for the energy of the particles when an exogenous field is involved. 
For instance Hq ~ £ v\l m f° r a system containing the molecules of a gas, ~ 
£ /■ _ x n 6 when one wants to take into account the van der Waals forces between the 
molecules, and i^exo ~ T,SjH(i) for the energy of a set of spins in an external 
magnetic field. 

Whereas the third term can possibly be omitted when the experimental device can 
be shielded from external fields, the interaction term must always be taken into ac- 
count^. Needless to say, there are almost no biological or social systems for which 
one has a clear knowledge of their interactions. It is precisely the main purpose of 
the present paper to explain how such interactions can be measured. 

Reasons for optimism 

The previous list of obstacles could appear discouraging especially if one realizes 
that there are many other problems in non-physical systems just for defining key- 
variables such as velocity or energy. However, there are also good reasons for opti- 
mism as we will see now. 

First it can be observed that the theory of phase transitions has been used to describe 
the transition between ordinary hadronic matter and quark- gluon plasma. As such 
states are characterized by temperature of the order of 10 12 K and life-times of the 
order of lCT 20 s, it means that this theory is applied well beyond the limits of the 
phenomena^ for which it was originally developed. Does the ergodic assumption 
hold for such extremely short time intervals? Nobody knows and probably nobody 
cares. The strategy of physicists is to use this framework without giving too much 
concern to underlying assumptions. If sensible results emerge this will provide so to 
say ex post justification. 

Secondly, it can be observed that the title of this paper does not refer to statistical 
physics but to physical chemistry. Why? 

• Although the objective of physical chemistry is also to explain the properties of 
macroscopic systems in terms of molecular interactions, there are two main differ- 
ences with the approach of statistical mechanics. First, physical chemistry considers 
a broad range of molecules rather than just the simplest ones as is done in physics. 
Thus, because many cases are being considered, it becomes indispensable to adopt a 
comparative perspective. Why is the melting point of argon lower than the melting 
point of water? Why is the equilibrium vapor pressure higher for ethanol than for 
water? And so on and so forth. 



4 Even in order to use a mean field approximation one must know the form of if m t er 

5 E.g. second order phase transitions such as the paramagnetic-ferromagnetic transition in iron. 
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• Because it would be an almost impossible task to propose (and solve) full- 
fledged models for all these cases, physical chemistry will rather resort to qualitative 
arguments. For instance, a standard argument is to observe that the stronger are 
molecular interactions in a liquid, the fewer molecules will be able to escape which 
in turn will lead to a low equilibrium vapor pressure above the liquid. Whereas this 
argument relies on a specific mechanism describing how molecules leave the liquid, 
it does not require any of the assumptions that we listed in previously. Even equilib- 
rium is not strictly required. Indeed, if the container is left open, no equilibrium will 
take place and no equilibrium vapor pressure can be defined, but the same argument 
can nevertheless be used for explaining differences in the evaporation rate. 

Such kind of argument can be used with success to explain many physical properties. 
For instance, the boiling temperature of alkanes (C n H2 n+ 2) is expected to increase 
with n because the so-called London attraction forces (due to induced polarization 
which create short-lived dipoles that attract one another) exist between all atoms 
and therefore, in the absence of any other force, attraction will be stronger for big 
molecules than for small ones. Through a similar argument one would also expect the 
heat of vaporization to increase with n. These predictions are indeed confirmed by 
experimental data; two graphs displaying such data can be found in Roehner (2004, 
p. 663). 

In short, once one knows the strength of interaction in a system, one should be able 
to derive several of its macroscopic properties. Thus, we are again confronted to 
the same key-question: how can we measure interaction strengths? To answer this 
question we will make yet another simplification. 

(3) The simplifications that we have already made consisted firstly in saying that 
we do not need to care too much about the underlying hypotheses of statistical 
physics, secondly that (at least in a first stage) there is no need to use the math- 
ematical framework of statistical mechanics. Now our objective has become less 
ambitious and the only question on which one needs to focus is to develop experi- 
mental ways for measuring coupling strength between the elements of the system. 
The word experimental leads us to a third simplification. 

It is often said that for socio-economic systems one cannot make experiments^. How- 
ever, this is only partially true. In fact, social sciences researchers are in the same 
position as astrophysicists. While they cannot perform any observation that they 

6 In the discussion which follows we leave apart so-called class-room experiments that are performed with small groups 
of students. Such experiments can be useful to study how people will react in specific circumstances such as in response to 
auction rules for instance. However, one does not see how collective behavior can truly be studied in such a way because 
the experiment will only reflect genuine behavior if the people are not told that they are involved in an experiment. In 
the 1970s and 1980s the psycho-sociologist Stanley Milgram has performed experiments of this kind. However, such an 
approach raises major ethical problems and should rather be avoided. 



would like to do, nevertheless they can use such statistical data that are available 
to make a limited number of observations^. Yet, one must recognize that in many 
investigations the very data that one would need turn out to be unavailable. This is 
a serious obstacle. The task of designing appropriate measurement methods is diffi- 
cult enough in itself; it would become altogether impossible if at each step progress 
is hindered by a lack of data. 

There is a simple solution. Instead of studying people we can study populations of 
living organisms such as bacteria, insects or small fishes. For all these populations 
there exists a broad range of species. Different species will have different inter- 
individual interactions. Thus, one is very much in the same position as in physical 
chemistry. In what follows we will limit ourselves to populations of insects. 

Our goal is to study groups of insects not at all as an entomologist would do but from 
the perspective of physical chemistry. In this respect living organisms have another 
important advantage over social or economic systems. Energy is a key-notion in 
physics. While it is not obvious how to define the "energy" of a set of stocks or a 
sample of companies, it is easy to define the velocity and kinetic energy of a group 
of ants. In other words, systems of living organisms are much closer to physical 
systems than are socio-economic systems. 

In the next section we explain how we designed and implemented our experiments. 
In the last section we propose some consistency tests of our results. 



Design of the experiments 

The experiment will be described for ants but their design is fairly similar for other 
insects such as fruit flies or beetles. 

A number n of ants are contained in a rectangular box (15cm long and 5cm wide, 
4mm high) (see Fig. la). In this box one defines two part: an area A and the part B 
of the box which does not belong to A. For the sake of simplicity we can think of A 
as being the left-hand side of the box as is the case in Fig. la. However, one should 
keep in mind that A can also be much smaller than one half of the box. This allows 
to explore the behavior of the ants at smaller scale. 

The ants can choose the compartment in which they wish to go or to stay. We record 
the number n^(t) of ants which are in compartment A at time t. 

The idea of the experiment is the following. 



'Researchers who have appropriate funding can even organize surveys in order to collect data that would not be 
available otherwise. 
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• Suppose for a moment that the movements of the ants are completely correlated. 
This means that if one ant goes from A to B (or from B to A) all the others will 
follow. Thus at each time step riA(t) may experience huge jumps, either from N to 
or vice versa. 

• Suppose now that there is a zero correlation between the movements of the ants. 
This means that if one ant goes from A to B it will not be imitated by others. Of 
course, other ants may make the same move but they will do so independently from 
one another. As a result their moves will follow a binomial process. A move of all 
the ants together is not completely excluded but it will occur with a probability of 
(1/2) N and decrease exponentially when N increases. 



Beginning of drift to the right 
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Fig. la (left): Experiments with ants in a two-compartment device. Fig. lb (right): Simulation with an inter- 
individual cross-correlation equal to 0.2. 

This argument suggests that there is a connection between the standard deviation of 
riA (t) and the average correlation between the movements of the ants. Needless to 
say, we wish to know the mathematical form of this relationship. Then, by recording 
the fluctuations of n^(t) we will be able to compute its variance and to derive the 
average correlation between ants. This average correlation can be considered as a 
measure of their interaction strength. 



Formalization 

To each ant i we associate a random variable X{ which takes the value 1 when % 
is in compartment A and otherwise. Thus, at any moment t, the number of ants 
in compartment A will be given by: S n = £™ X\. If riA(t) is a stationary random 
function, it is reasonable^ to assume that the variance computed from the time series 
riA (t) coincides with the probabilistic (i.e. ensemble) variance of the random variable 

Various assumptions can be made regarding inter-individual interaction. Each as- 



8 While of course necessary, the stationarity condition is not sufficient to guaranty ergodicity of the standard deviation. 
The specific mathematical condition that nA(t) must satisfy is given in Papoulis (1965, p. 330). 
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sumption leads to different correlations between the X{. We will examine two spe- 
cific cases: uniform correlation which means that = r(X{,Xj) is basically the 
same for all pairs and correlations which decrease exponentially when the dif- 



Uniform correlation 

In this case, a 2 (S n ) = cr 2 (n^) is given by the following proposition. 

Variance of a sum of uniformly correlated variables. We consider a sum S n 
of n identically distributed random variables Xi of variance a 2 . We assume that 
between Xi,Xj,i ^ j there are cross-correlations r^, the average of which is is 
denoted by r: r = ^ n \y 2 ] ^T<j Ti .r Th en, the variance of S n = Xi + . . . + X n 
is given by: 



The proof is fairly straightforward and is given in Appendix A. 

Four observations are of interest in relation with formula (1). 

(1) The factor no 2 represents the variance of S n when the variables are uncorre- 
cted. Therefore the ratio on the left-hand side represents the variance of S n divided 
by what it would be if the correlations are switched off. Subsequently this ratio will 
be denoted by g 2 . 

(2) In the special case where r = 1, formula (1) gives: a 2 (S n ) = n 2 a 2 . This 
result can be confirmed by observing that r = 1 means that all variables X{ are 
identical that is to say take the same values (with probability 1). Thus, S n = nX\ =4> 



(3) A negative average correlation reduces the variance instead of increasing it. 
This would correspond to a repulsive force between the individuals. It is of interest 
to observe that r cannot become smaller than — l/(n — 1). In this case the variance 
is reduced to zero. Intuitively, this corresponds to a situation where the move of 
any individual is countered by the moves of the others in a way which leaves S n 
unchanged. 

(4) Formula (1) applies to any random variables X^ For the problem in which we 
are interested, the X{ have a special meaning from which results that: 



o*(Xi) = E{X 2 ) - E 2 (X l ) = P{X, = 1}1 - (P{X, = 1}1) 2 = p(l-p) 



where p is the fraction of A with respect to the total area. 
Ising-like correlations 

When the interaction is restricted to nearest neighbors as in the one-dimensional 
Ising model for spins, the correlation between the Xi decreases exponentially when 



ference i — j increases: r 





(1) 



<J 2 (S n ) 



(7 2 (nXi) 
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the "distance" between the spins increases (Glauber 1963 p. 300). In this case the 

variance of S n is given by the following proposition. 

Variance of a sum of Ising-like correlated variables. We consider a sum S n 
of n identically distributed random variables X{ of variance a 2 . We assume that 



The proof is fairly similar to the proof of the first proposition and it is outlined in 
Appendix A. According to this result, the ratio g 2 (n) = is slightly increasing 
when n increases (see Appendix A). However, when n becomes large the term in- 
volving n becomes negligible with respect to the first term. Thus, it is legitimate to 
say that for large n, g 2 (n) is almost constant. 

Can one explain the difference between case 1 and 2 intuitively? We have already 
observed that if r is close to 1, almost all insects will cross from one side to the other 
at the same time which will result in big fluctuations of n^i) between and n. In 
the second model the parallel of such a high correlation would be 77 close to 1, e.g. 
77 = 0.9. Yet, even with such a value of 77 the correlation between i and its neighbors 
will fall off rapidly when the distance increases. This means that when i will change 
side, only a small number (/) of neighbors will follow. As / depends only upon 77 
(and not upon n) one sees that g 2 (n) does not increase with n. 

In short, for the models that we considered the ratio g 2 {n) can behave in three differ- 
ent ways as a function of n. 

(1) It decreases linearly when r < 

(2) It is almost constant when decreases exponentially with respect to \i — j\. 

(3) It increases linearly when r > 0. 

We will see that only cases 1 and 3 occur in our observations. 



Procedure 

The experimental procedure involves the following steps. 

• First one must spread n ants fairly uniformly in the whole container. Then 
pictures will be taken every 5 seconds 




Experimental results 



9 An "appropriate" time interval is important for the accuracy of the measurement. Of course, it is useless to take 
pictures when nothing happens that is to say when nA(t) does not change. On the other hand, simulations show that one 
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• These pictures will allow us to record the numbers riA(t). Once the variance of 
this time series has been computed one gets the ratio g 2 (n). 

• By repeating this procedure for different number of ants one gets results which 
can be represented as a set of points (n — 1, g 2 {nj) (see Fig. 2). 

• A linear regression performed on this set of points gives an estimate of the slope 

r. 

Results 

See the graphs in Fig. 2a,b,c. 

An important observation is in order regarding the magnitude of the estimated av- 
erage correlation. First, it must be emphasized that r is very different from the cor- 
relation estimated from a scatter-plot. In the latter case a correlation as low as 0.01 
would be non-significant (in the sense that the confidence interval would contain 0) 
except if the scatter-plot contains several thousand data points. Here, however, the 
correlation was obtained as the slope of a regression line and its estimate is quite 
significant as can be seen from the size of the error bars. 

In order to get an intuitive understanding of r, one should compare the actual trajec- 
tories of the insects to those shown in the simulation of Fig. lb. Broadly speaking, 
the comparison will reveal that at individual level the actual trajectories of the in- 
sects are even more random than those in Fig. lb. In spite of this high degree of 
randomness there is an observable effect at the macro level. The situation is some- 
what the same as for a gas or a liquid. In spite of the randomness of the movements 
of individual molecules there are nevertheless well defined macroscopic properties. 

Problems 

Although the procedure may appear fairly straightforward there are a number of 
hurdles; while some are purely technical others are of more fundamental importance. 
Let us begin with the latter. 

Ideally, in order to remain in a stationary equilibrium situation one would like riA(t) 
to fluctuate around 1/2. Actually, for ants as well as for beetles, riA(t) can become 
very different from 1/2. This is due to the fact that in such cases the individuals will 
form a big cluster in one part of the container. Thus, if the cluster is in A, the ratio 
n>A(t)/n will become close to 1, whereas it will decrease toward if the cluster is in 
part B. 

In a sense, this clustering behavior is good news because it is a direct proof of the 
existence of an inter-individual attraction. On the other hand, however, it introduces 
a bias in the measurement of r. A correction procedure was introduced to take this 

can greatly improve the accuracy of the measurement by increasing the number of pictures. In our experiments, depending 
on the activity and number of insects S was between 10s to 120s. 
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Mean correlation (xlOO): -1.29 
Clustering: no 
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Beetle (Tenebrium molitor) 
Mean correlation (xlOO): 1.47 
Clustering: yes 
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Fig. 2a,b,c: Relation between the variance of the number of individuals in a compartment A and the size 
of the group. We suppose that the whole domain which contains the ants has been divided into two parts and we 
observe the fluctuations riA(t) of the number of individuals in part A. The slope of the regression line gives an 
estimate of the mean correlation between the moves of individual elements. The negative correlation observed 
for drosophila can be interpreted as the result of repulsive forces between individuals. The ability to form 
clusters can be seen as revealing the existence of attractive inter-individual forces. Thus, this characteristics 
comes as a confirmation of the sign of the correlation. 

The confidence intervals (at a probability level of 0.90) are as follows: ants: lOOr = 3.37 ± 0.9, drosophila: 
lOOr = —1.29 ± 1.16, for the beetle graph there are too few points to compute the confidence interval. 
These experiments were performed between June and October 2012 in three different places, first in Paris (ants 
and drosophila), then in Beijing (drosophila) and finally in Kunming, Yunnan Province, China (beetles). 



effect into account. 

There is a problem which arises especially for drosophila, namely the fact that once 
introduced in the observation device only a few of the insects will move. In the case 
of drosophila this may take the following form: in a group of some 50 only about 
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5 to 10 will move at one moment and they will do so with great speed going from 
one end of the container to the other without seemingly caring about the 45 others0. 
Another circumstance which will prevent the insects from moving is when they form 
a cluster. Although a correction can be introduced in the analysis to take into account 
such "frozen" elements, it is clear that the analysis eventually becomes meaningless 
when the proportion of frozen elements is too high. 



The formation of clusters also leads to a more 
practical difficulty namely the fact that once 
ants are part of a cluster their spatial density be- 
comes so high that it is difficult to count them. 
As they form several layers, counting becomes 
nearly impossible even on high resolution pic- 
tures. Recently, we have tried an alternative 
method which consists in weighing rather than 
counting. This method is well suited for small 
beetles whose unit weight is of the order of 
15mg (see Fig 3). It is more difficult for ants 
whose typical weight (e.g. workers of "Formica 
japonica") is about 3mg. It is altogether im- 
possible for drosophila whose typical weight is 
around 0.2mg. 




Fig. 3: Container with weighing device on 
one side. The compartments A and B are slightly 
(0.5mm) disjoined along the blue and red lines re- 
spectively so that the weight measured by the scale 
corresponds only to the beetles contained in the left- 
hand side part but that the beetles can nevertheless 
cross from A to B and vice-versa. Here most of the 
beetles have formed a cluster in a comer. The weight 
is 357 mg which, when divided by 15 mg, gives a to- 
tal of 24 beetles. 



Consistency tests 

For a liquid inter-molecular attraction can be estimated through various means and 
variables: evaporation rate, equilibrium pressure of vapor, boiling temperature, heat 
of vaporization. It is the fact that such estimates are (at least most often) consistent 
with one another which gives us confidence in them. One would like to do the same 
here. 

A simple qualitative consistency test is provided by the following "evaporation" ex- 
periment. One takes a test tube containing some 50 drosophila and one makes them 
all move to the bottom of the tube by hitting the tube on a table. Then, very quickly0 
one puts the tube on the table in horizontal position. Let us assume that the bottom 
of the tube is on the left. After a few seconds, some 5 flies will have reached the 
right-hand side, and may be 10 others will be in the middle of the tube. If one waits 
5mn, the flies will be distributed fairly uniformly throughout the tube. 

If one repeats the same experiment with "Tenebrio molitor" beetles it will be seen 
that after 5mn almost all insects are still together on the left-hand side of the tube. 

ll) Whereas ants will tend to slow down or stop every time they come close to another ant. 
"This movement must be fast because drosophila have a natural tendency to go upward. 
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This experiment can be repeated in a more precise way by using the following pro- 
cedure. 
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Fig. 4a: "Evaporation" experiment with beetles. In the evaporation version of the experiment (top) the 
beetles move from the container into open space (i.e. the laboratory table) whereas in the equilibrium version 
they move from part 1 of the container into part 2 of same size. In the first case almost no beetles come back 
into the container just like the molecules in the evaporation of a liquid. The graph shows that the dropout rate 
decreases when the size of the population increases pointing to greater attraction power of larger groups. In 
physics similar effects can be observed. For instance the vapor pressure around dropplets of liquid decreases 
when the dropplets increase in size (Kelvin equation) and the melting point of gold particles increases with the 
diameter of the particles (Buffat and Borel 1976, p. 2294). 

For each value of n the experiment was repeated 10 times, which means that 80 experiments were performed 
altogether. For the 10 repetitions the coefficient of variation a/m was around 50%. The slopes of the regres- 
sion lines (with the numbers of beetles expressed in thousands) are as follows (the error bars correspond to a 
probability level of 0.90): 

evaporation: -2.8 ±0.5; 1 to 2, lOmn: -1.10 ±0.7; 1 to 2, 30mn: -2.2 ± 1.8; not in cluster: -1.3 ±0.8. The 
average slope is a = —2.0. The experiments were done in November 2012 by Ms. Mengying Feng and Shuying 
Lai from Beijing Normal University, Department of Systems Science. 

The experiment starts after a number n of beetles has been introduced into a con- 
tainer that we will call part 1. In the "evaporation rate" version of the experiment, 
the beetles can just walk out into open space. In the equilibrium version of the exper- 
iment the opening of part 1 leads to a container of same size. In this case, most often, 
the beetles formed a cluster both in part 1 and in part 2. However not all the beetles 
were in the clusters. This leads to the definition of two different variables: ri2(t), the 
number of beetles in part 2 at time t, and n^{t), the number of beetles which are not 
in a cluster. It is this latter variable which is the analog of the molecules in the vapor 
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Fig. 4b: "Not in cluster" experiment with bees. After formation of a bee cluster, the number of those outside 
of the cluster were counted. The duration of each experiment was comprised between one hour and one hour 
and a half. The three different colors correspond to slightly different experimental conditions. For instance, 
for the black data points there was a single cluster whereas for the red points two clusters formed. In the latter 
case we divided all numbers by 2. The slope of the regression line (also expressed per 1,000 bees) for the 
7 experiments, namely a = —0.55 ± 0.68, is 2.3 times smaller than the "not in cluster" slope in the beetles 
experiment. The experiments were done in June and July 2012 by Mrs. J. Darley and B. Roehner in Vol Fleury 
(western suburb of Paris). The bees were Appis Mellifera mellifera. 

phase over a liquid. The observations summarized in the figure show that whether in 
the non-equilibrium case of evaporation or in the quasi-equilibrium case, the escape 
rate decreases when the number of beetles increases. A natural interpretation is that 
the combined attraction of n beetles on one of them increases with ro . 



Conclusion 

In physics real progress occurs when there is a fruitful dialogue between theory and 
observation. This is currently one of the problems faced by string theory. There is 
a similar problem with computer simulations of social phenomena because of the 
fact that they rarely lead to testable predictions and when they do, most often, the 
requested statistical data turn out to be unavailable. Thus, there is almost no dialogue 
between theory and observation. This greatly hampers real progress. 

12 More precisely, one can say that the experiment displays two competing forces: (i) attraction and (ii) increased 
volatility. The increased volatility likely comes with the beetles' new environment. Indeed, when they are left alone for a 
long time they cluster together instead of occupying the whole available area, a typical liquid-like behavior. 
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For experiments on groups of insects the situation is completely different. Usually a 
model set up to account for a given phenomenon also leads to predictions for other 
phenomena. The nice feature is that most often the experiments required for explor- 
ing those phenomena can be designed and implemented within a few days. In terms 
of speed and convenience such experiments are very much alike computer simula- 
tions. In addition they allow a real dialogue between theory and observation. 

What kind of experiments should be tried by physicists? Clearly, it would be useless 
to repeat the experiments already done by entomologists. So far however, entomolo- 
gists have given only scant attention to the exploration of group effects. The experi- 
ments which come closest to those that we advocate are probably those conducted by 
the teams of Jean-Louis Deneubourg (a former physicist) in Brussels and Deborah 
Gordon at Stanford. However, because they confined themselves to the study of ants 
both Deneubourg and Gordon could not draw on the benefits and broader perspective 
that might come from comparative studies^.. 

Appendix B gives some practical hints for performing experiments with living organ- 
isms. We hope that this information will enable a number of other groups to carry 
out such experiments. This is a field where there is much to explore. For instance, 
some preliminary observations convinced us that the temperature^ plays a role in 
this kind of experiments which is fairly similar to what can be seen in chemistry 
and statistical physics. However, this must be confirmed and documented by a set of 
systematic experiments. 

Acknowledgments: For their helpful comments we would like to thank the re- 
searchers who attended a seminar given by one of us at the Department of Economics 
of the University of Tokyo on 28 November 2012. 

One of the experiments reported above was done in Kunming at the Eastern Bee 
Institute of Yunnan Agricultural University, China; we wish to express our appreci- 
ation to Prof. He and Tan for their hospitality and to Dr. Chen and Wang for their 
help. 

We also want to express our gratitude to Ms. Mengying Feng and Shuying Lai from 
the Department of Systems Science of Beijing Normal University whose results were 
mentioned in our discussion of the "evaporation" rate of beetles. 



13 A few centuries ago when physicists studied the phenomenon of "free" fall they did not confine themselves to falling 
apples. Indeed, comparative observation was the only way to demonstrate that, at least in air, the law is fairly independent 
of the shape and density of the falling object. This was a milestone in the development of classical mechanics. 

14 In a general way observation shows that the "temperature" of the living organisms (in the sense of statistical physics) 
is determined by the temperature of the environment. 
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Appendix A: Variance of a sum of correlated variables 

We proceed in several steps. 

For the sake of simplicity we first consider the average of a sum of three correlated random variables 
Xi, X 2 , X 3 of mean m and identical standard deviation a. Our objective is to compute the variance 

ofS 3 = X 1 +X 2 + X 3 . 

By definition of the variance a 2 (S 3 ) = E (S3 — E(S 3 )) 2 . One knows that the expectation of a 
sum of random variables is always equal to the sum of the expectations, whether the variables are 
correlated or not. Thus: E(S 3 ) = E(X X ) + E(X 2 ) + E(X 3 ). 
Consequently: 



E 



L i=l 



, where: Xi = Xi — E(Xi 



Thus, 



a 2 (S 3 ) = J2 E(Xf) + 2 \E(X 2 X 3 ) + E{X 3 X{) + E(X,X 2 



i=i 



We express the expectations of the products by introducing the coefficient of correlation of the Xi 

r y = EiX.X^/a 2 . Thus: a 2 (S 3 ) = 3a 2 + 2a 2 (r 23 + r 31 + r 12 ) 

From that point on, we will consider two cases. 
Uniform correlations 

Introducing the mean of the r^, r = [r 23 + r 3 \ + ri 2 )/3, we obtain: 

(7 2 (5 3 ) = 3a 2 [l + 2r] 

This formula has an obvious generalization to an arbitrary number n of random variables: 



a 2 (S n 



2 2 
na g , 
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(n — l)r + 1 



where: 



[n(n - l)/2 



Ising-like correlations 

For a one dimensional Ising spin system the correlation between spins i and j is: = where 
i] can be expressed (if one wish) as a function of the parameters which define the interaction between 
the spins (see Glauber 1963 p. 299, formulas (56) and (57)). 

Introducing this expression of gives: a 2 (S 3 ) = er 2 (3 + 2r] + rj 2 ) 

In extending this formula to any n, one needs to express the finite sum f{r]) = J2i=o ff ( as wei l as i ts 
derivative f'(i])). Instead of using the exact expression f(r/) = (1 — ?] n_1 )/(l — 77) we will consider 
that the term r/" -1 is negligible with respect to 1, which means that we approximate the finite sum by 
the corresponding infinite series. This approximation is acceptable for our experiments because most 
of the time n > 20. Of course the approximation is no longer valid when 77 — >- 1 but 77 = 1 is the case 
of uniform correlation already considered above. 



Under this assumption one obtains finally: 
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or: 

2 a 2 (S n ) 1 + 7] 2 V 

9 (n) = — = - 

na z l — r] n(l — 7]) z 

Due to the approximation made in the derivation, this formula is not valid when n is close to 1 . We 
have seen above that for n = 2, 3 one gets: 

g 2 (2) =l + 7j, g 2 (3) = 1 + (4/3)7/ + (2/3)r? 2 

which shows that the function g 2 (n) increases toward its asymptotic limit (1 + 77) / (1 — 77). 

Remark Can the Ising case be seen as a special instance of the uniform case? Formally, it may seem 
so. However, the real picture emerges when we consider large values of n. In the Ising case, due to 
the exponential decrease, all elements in the correlation matrix are almost equal to zero except for a 
zone around the first diagonal whose width depends only upon 7]. Consequently, for such a matrix the 
average correlation goes to zero when n becomes larger. 

This observation shows three things, (i) It would be irrelevant to treat the Ising case as a special 
instance of the uniform case, (ii) The fact that in the Ising case r ~ helps to explain that the ratio 
g 2 (n) remains basically constant instead of increasing, (iii) It explains why we used the expression 
"uniform correlations" to designate the first case. The correlations are uniform in the sense that when 
n — > 00 the number of elements of the correlation matrix that are "substantially" different from zero 
must remain of the same order of magnitude as n. For a distance-dependent correlation, this means 
that the decrease with distance must be slow enough. 

Simulations 

So far we did not need to make the assumption that the Xi are Bernoulli variables, that is to say 
variables taking only the values and 1 0. However, if one wishes to carry out a simulation there is a 
convenient algorithm which works only for Bernoulli variables (Lunn and Davies 1998). The relevant 
formulas can be summarized as follows: 

Simulation of uniform correlations between n Bernoulli variables Z and Yi are Ber(p) 
random variables while the Ui are Ber(y/r) random variables. Then, the variables Xi defined 
as: 

Xi = (1 - Ui)Yi + U t Z, i = l,...n 
are correlated Bernoulli variables with the following properties: 

E(X l )=p, E(X 2 )=p, Cov(X h Xj)=r, i^j 

It can be noted that this algorithm works only for positive correlations between the variables. 

Simulation of correlated Ising-like Bernoulli variables Yi are Ber(p) random variables 
while the Ui are Ber(r/) random variables. Then, the variables Xi defined as: 

X 1 = Y u X t = (l- U t )Y t + UiX^, 2<i<n 

are correlated Bernoulli variables with the following properties: 

E(Xi)=p, E(X 2 )=p, Cor(X i ,X j ) = riW, i^j 



When P{X = 1} = p such a variable will be noted as Ber(p). 
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Fig lb presents a simulation of the trajectories of ants. Such simulations are useful for testing the 
estimation procedure. How were they done? 

• First of all, in order to introduce a time-continuity which obviously exists in real experiments we 
generated the n random variables Z iit through n independent first-order auto-regressive processes. 

• Then, the correlations between the variables were introduced following the so-called Cholesky 
procedure by defining the X i;t as appropriate linear combinations of the Zj )t , j — 1, . . . n. 

Appendix B: Experimental "toolkit" 

Just in order to convince the reader that experiments with insects can be done fairly easily we give 
some practical hints. It is indeed possible to do this kind of experiments with fairly little sophisticated 
equipment. 

Basically, the needs can be summarized as follows: First one needs to get the living organisms. 

• Ants can be easily collected (at least in spring and summer) by putting appropriate food as a bait 
on a Bristol board just a few centimeters away from the entrance of a colony. Within one hour and 
depending on the species a few hundred ants may gather on the Bristol board. 

• Drosophila can be obtained from biology laboratories. 

• Flies and beetles can be bought in the form of larvae (worms) destined to fishermen or for 
feeding big aquarium- fishes. The waiting time between the larvae stage and the emergence of the 
adults ranges from less than one week to a few months depending on species, temperature and time 
of year. 

Secondly, in many cases, one needs a small bottle of carbon dioxide to make the insect sleep in order 
to be able to handle them easily. Carbon dioxide has an almost instantaneous anesthetic effect on 
all these insects. According to a paper published in the Journal of Experimental biology (Ribbands 
1950) anesthesia through carbon dioxide does not infer a memory loss and changes only slightly the 
behavior of bees. It is probably safe to assume that the effect on the other insects mentioned above is 
similar. 

Next one needs an appropriate container. A simple solution is to cut it into a piece of flexible plastic 
(such as PVC) of adequate thickness (3mm to 5mm is usually enough). This is illustrated in Fig. la. 

Finally, one needs a counting device. Taking pictures and counting by hand is a simple solution but 
not always satisfactory especially for counting the elements in a cluster. For this reason we have 
developed a weighing method (illustrated in Fig. 3). 

Clustering phenomena also occur among bacteria and micro-organisms that are present in so-called 
biofilms which form at the surface of liquids. Because of the small size and high numbers of such 
elements one is in a situation fairly similar to physical systems. For instance, it can be mentioned that 
inter-molecular forces such as van der Waals forces play a significant role in the movements of such 
micro-organisms. 

Studying the collective behavior of such populations from the perspective of physics seems a promis- 
ing field. However, in contrast to the study of insects, it requires special laboratory devices and 
equipment. 
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